Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection
نویسندگان
چکیده
Real time magnetic resonance imaging (rtMRI) enables direct video capture of the moving vocal tract concurrent with audio signal providing valuable data for speech research. We consider a multimodal approach to voice activity detection (VAD) in the rtMRI recording that uses audio signal as well as MRI image sequence. The degraded quality of the audio recorded in the scanner motivates this multimodal scheme for robust VAD. Optimal regions in the MRI image are selected for performing VAD with a novel algorithm. VAD experiments using rtMRI data of two male and two female subjects show that VAD performance using optimally selected regions from MRI images is comparable to that using only audio signal. The optimal regions turn out to be parts of jaw, velum, glottis and lips. VAD performance using audio signal and MRI image sequence together is found to be significantly better (∼14% absolute improvement in VAD accuracy) than that using the audio only when the audio is contaminated with additive noise at low SNR.
منابع مشابه
Design and Construction of a Head Probe Coil for Vocal Tract Imaging
Magnetic resonance imaging (MRI) is nowadays a most widely use in medicine for diagnostic imaging and in research studies. At the present time many research studies follow with problematic about human vocal tract modeling. This paper is devoted to the design and optimization of head probe coil for vocal tract imaging. For the reason of requirement voice recording during measurement, this head p...
متن کاملReal-time magnetic resonance imaging investigation of resonance tuning in soprano singing.
This article investigates using real-time magnetic resonance imaging the vocal tract shaping of 5 soprano singers during the production of two-octave scales of sung vowels. A systematic shift of the first vocal tract resonance frequency with respect to the fundamental is shown to exist for high vowels across all subjects. No consistent systematic effect on the vocal tract resonance could be sho...
متن کاملUsing functional magnetic resonance imaging (fMRI) to explore brain function: cortical representations of language critical areas
Pre-operative determination of the dominant hemisphere for speech and speech associated sensory and motor regions has been of great interest for the neurological surgeons. This dilemma has been of at most importance, but difficult to achieve, requiring either invasive (Wada test) or non-invasive methods (Brain Mapping). In the present study we have employed functional Magnetic Resonance Imaging...
متن کاملUsing functional magnetic resonance imaging (fMRI) to explore brain function: cortical representations of language critical areas
Pre-operative determination of the dominant hemisphere for speech and speech associated sensory and motor regions has been of great interest for the neurological surgeons. This dilemma has been of at most importance, but difficult to achieve, requiring either invasive (Wada test) or non-invasive methods (Brain Mapping). In the present study we have employed functional Magnetic Resonance Imaging...
متن کاملBrain Activity Map Extraction of Neuromyelitis Optica Patients Using Resting-State fMRI Data Based on Amplitude of Low Frequency Fluctuations and Regional Homogeneity Analysis
Introduction: Neuromyelitis Optica (NMO) is a rare inflammatory disease of the central nervous system which generally affecting the spinal cord and optic nerve. Damage to the optic nerve can result in the patient's dim vision or even blindness, while the spinal cord damage may lead to sensory and motor paralysis and the weakness of the lower limbs in the patient. Magnetic Reson...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014